On instantaneous vocal tract length estimation from formant frequencies
نویسنده
چکیده
The length of the vocal tract and its relationship with formant frequencies is examined at fine temporal scales with the goal of providing accurate estimates of vocal tract length from acoustics on a spectrum-by-spectrum basis despite unknown articulatory information. Accurate vocal tract length estimation is motivated by applications to speaker normalization and biometrics. Analyses presented are both theoretical and empirical. Various theoretical models are used to predict the behavior of vocal tract resonances in the presence of different vocal tract lengths and constrictions. Real-time MRI (Narayanan et al., 2011) with synchronized audio is also utilized for detailed measurements of vocal tract length and formant frequencies during running speech, facilitating the examination of short-time changes in vocal tract length and corresponding changes in formant frequencies, both within and across speakers. Previously proposed methods for estimating vocal tract length are placed within a coherent framework and their effectiveness is evaluated and compared. A data-driven method for VTL estimation emerges as a natural extension of this framework, which is then developed and shown to empirically outperform previous methods on both synthetic and real speech data. A theoretical justification for the effectiveness of this new method is also explained.
منابع مشابه
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies
Vocal tract length is highly variable across speakers and determines many aspects of the acoustic speech signal, making it an essential parameter to consider for explaining behavioral variability. A method for accurate estimation of vocal tract length from formant frequencies would afford normalization of interspeaker variability and facilitate acoustic comparisons across speakers. A framework ...
متن کاملSpeech formant frequency estimation: evaluating a nonstationary analysis method
The objective of this paper is to critically evaluate the performance of a nonstationary analysis method in tracking speech formant frequencies as they change with time due to the natural variations in the vocal-tract system during speech production. The method of instantaneous frequency estimation is applied to the tracking of speech formant frequencies to observe the time variations in the vo...
متن کاملTracking of involuntary formant frequency variations and application to parkinsonian speech
The objective of this paper is to present a formant frequency estimation method, developed with a view to track small variations due to involuntary vocal tract movement. The formant frequency estimation is based on the instantaneous frequencies obtained by means of a complex wavelet transform and is synchronised with the glottal cycle. Results for synthetic speech signals show the precision of ...
متن کاملCorrelation between vocal tract length, body height, formant frequencies, and pitch frequency for the five Japanese vowels uttered by fifteen male speakers
We conducted quantitative analyses of a magnetic resonance imaging (MRI) database to examine the correlation between physical measures (vocal tract length and body height) and acoustic parameters (pitch and formant frequencies) of vowels. The vocal tract length was measured from MRI data for the five Japanese vowels produced by fifteen male Japanese speakers between the ages of 24 and 55. The a...
متن کاملAn experiment in vocal tract length estimation
December 13-15, 2007: Firenze, Italy, ed. by C. Manfredi, ISBN 978 88-8453-673-3 (print) ISBN 978-88-8453-674-7 (online) © Firenze university press, 2007. Abstract: The presentation concerns the estimation of the vocal tract length of a speaker on the base of her formant frequencies and the formant frequencies and known tract length of a reference speaker. The length prediction is founded on a ...
متن کامل